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Abstract We address optimization of nonlinear functions of the form f{Wx) , where / : R'^ ^ R is a 
nonlinear function, W is a d x n matrix, and feasible x are in some large finite set J- of integer points 
in R" . Generally, such problems are intractable, so we obtain positive algorithmic results by looking 
at broad natural classes of / , and J- . 

One of our main motivations is multi-objective discrete optimization, where / trades off the linear 
functions given by the rows of W . Another motivation is that we want to extend as much as pos- 
sible the known results about polynomial-time linear optimization over trees, assignments, matroids, 
polymatroids, etc. to nonlinear optimization over such structures. 

We assume that the convex hull of J- is well-described by linear inequalities (i.e., we have an efhcient 
separation oracle). For example, the set of characteristic vectors of common bases of a pair of matroids 
on a common ground set satisfies this property for J- . In this setting, the problem is already known to 
be intractable (even for a single matroid), for general / (given by a comparison oracle), for (i) d = 1 
and binary-encoded W , and for (ii) d — n and W = I . 

Our main results (a few technicalities suppressed): 

1- When T is well described, / is convex (or even quasiconvex) , and W has a fixed number of rows 
and is unary encoded or with entries in a fixed set, we give an efficient deterministic algorithm for 
maximization. 

2- When J- is well described, / is a norm, and binary-encoded W is nonnegative, we give an efficient 
deterministic constant-approximation algorithm for maximization. 

3- When is well described, / is "ray concave" and non-decreasing, and W has a fixed number of 
rows and is unary encoded or with entries in a fixed set, we give an efficient deterministic constant- 
approximation algorithm for minimization. 

4- When T is the set of characteristic vectors of common bases of a pair of vectorial matroids on a 
common ground set, / is arbitrary, and W has a fixed number of rows and is unary encoded, we give 
an efficient randomized algorithm for optimization. 
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Introduction 

Generally, we are considering nonlinear discrete optimization problems of the form 
P(^,opt,/,W^) : opt {fiWx) : x e T} , 

where opt G {min, max} , W G Z'^^" is is a matrix of integers, function / : R** ^ R is given by a 
comparison oracle, and T is a large finite subset of Z" . 

One motivation for this setting is multiobjective optimization, where we seek an optimal "balanc- 
ing", via a nonlinear function / , of d competing linear objectives Wi.x . Another motivation is the 
desire to extend known results about polynomial-time linear optimization over combinatorial structures 
to nonlinear optimization over such structures. 

We assume that is "well described" — that is, T C S{n,(3) := {x G R" : 1^ x < (3} , for some 
unary encoded p G Z+ , and that we have a separation oracle for V :— conv(J^) . For example, the set 
of characteristic vectors of common bases of a pair of matroids on the common ground set {1,2, ... ,n} 
satisfies this property for J- (with p = n) . 

In this setting, the problem is already known to be intractable (even for a single matroid), for 
general / (given by a comparison oracle), for (i) d = 1 and binary-encoded W , and for (ii) d = n 
and W = I . Therefore, in much of our work, we assume that the objective vectors Wi. have a coarse 
encoding. This can take the form of (i) unary encoding, (ii) entries in a fixed set of binary-encoded 
numbers, or (iii) "generalized unary encoding" (a common generalization of (i) and (ii)). Furthermore, 
to obtain efficient algorithms, we often require that the number d of rows of W is fixed. 

We have four main results which below we describe and relate to the literature. Precise statements 
of the theorems are given in the later sections. 

(Theorem [TT) When J- is well described, f is convex (or even quasiconvex) , and W has a fixed 
number of rows and is unary encoded or with entries in a fixed set, we give an efficient deterministic 
algorithm for maximization. 

• As a very special case = {n' x n' permutation matrices} , W G Z''^" ^" unary encoded, / 
convex), we obtain a result of [3]. The special case in which T is the set of characteristic vectors 
of common independent sets or common bases of a pair of matroids on N :— {1,2, ... ,n} 
appears to be new (note that in such a case we may take (3 = n), even for / convex. A fortiori, 
also new is the special case in which J- is the set of integer points that are in a pair of integral 
polymatroids or associated base polytopes. Furthermore, the special case in which JF is the set 
of characteristic vectors of matchings of a nonbipartite graph appears to be new, even for / 
convex. 

(Theorem [2]) When T is well described, f is a norm, and binary- encoded W is nonnegative, we 
give an efficient deterministic constant- approximation algorithm for maximization. 

• As a very special case {J- = {n' x n' permutation matrices} , W G Z''^" ^" , / a p-norm), we 
obtain a result of |3^. 

(Theorem[3|) When T is well described, f is "ray concave" and non- decreasing, and W has a fixed 
number of rows and is unary encoded or with entries in a fixed set, we give an efficient deterministic 
constant- approximation algorithm for minimization. 

• This theorem generalizes a result of 3^ which looked at the very special case of: / G {p— norms} , 
T := {n' X n' permutation matrices} , W G Z''^" ^" unary encoded. In addition, our more 
detailed analysis of p-norms further generalizes the result of [3] pertaining to the 2-norm. 

(Theorem |4|) When T is the set of characteristic vectors of common bases of a pair of vectorial 
matroids on a common ground set, f is arbitrary, and W has a fixed number of rows and is unary 
encoded, we give an efficient randomized algorithm for optimization. 

• This theorem can be contrasted with a result in [2] which established a deterministic algorithm 
for the case of a single vectorial matroid. 

Note that all of our results extend to the generalization where we have a binary-encoded c G Z" 
giving a "primary" linear objective function x that we maximize, and, subject to that maximization, 
we seek an optimal balancing, via a nonlinear function / , of d competing linear objectives Wi.x : 



Pc(^,opt,/, W) 



opt { f{Wx) : a; G argmax {c^ii : x e J-}} 
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The reason is that we can find the optimal value z* of c x over J- using the ellipsoid method and 
the separation oracle for V = conv(J-') , and then the equation x — z* together with the separation 
oracle for V yields a separation oracle for the face Vc '■= P n {a; G M" : x = z*^ oiV . 

Relevant background material (including polytopes, matroids, polymatroids and hnear program- 
ming) can be found in [4ll9llllj . Additional relevant material on nonlinear discrete optimization can be 
found in [6117118] . 

In Sjl] we develop an algorithm for quasiconvex combinatorial maximization. Some of the elements 
that we develop for this are used in later sections as well. In i jl.li we introduce our "generalized 
unary encoding" scheme which we use for weight matrices. In §1.21 we investigate relevant properties 
of the fibers (i.e., inverse images) of points in the linear image of a polytope. In §1.31 we apply 
properties of fibers to give an efficient algorithm for quasiconvex combinatorial maximization. In Sj2l 
we consider approximation algorithms as a means of relaxing the assumptions that gave us an efficient 
algorithm in the previous section. We develop one algorithm for norms and another for a very broad 
generalization of norms. In §2.11 relaxing a requirement on the input weight matrix W ^ we give an 
efficient approximation algorithm for combinatorial maximization of norms / . In ^2.2[ we further 
exploit the geometry of fibers to give an efficient approximation algorithm for nonlinear combinatorial 
minimization, when W is nonnegative and / is non-decreasing and "ray-concave" on the nonnegative 
orthant. In ij3l for unary-encoded weight matrices W , we give an efficient algorithm for optimizing an 
arbitrary nonlinear function / , over the set T of characteristic vectors of common bases of a pair of 
vectorial matroids on a common ground set. 

1 Quasiconvex combinatorial maximization 

1.1 Generalized unary encoding of weights 

General binary encoding of weights is typically too parsimonious to allow for the construction of 
theoretically efficient algorithms for nonlinear discrete optimization. So we turn our attention to less 
parsimonious encodings. We consider weights Wij of the form 

p 
fc=i 

with integer p > 1 fixed, the distinct positive integers being binary encoded, but the integers 
(5f J (unrestricted in sign) being unary encoded. It is convenient to think of unary-encoded matrices 
5^ = ((S^ j)) e Z''^" , for 1 < fc < p , and binary encoded a e Zp , with > , for 1 < fc < p . Then 

we have W := ((IF,,,)) = ELi ^kS'' . 
We have the following special cases: 

1. Unary-encoded weights: With p ~ 1 and ai = 1 , we get the ordinary model of unary-encoded 

2. Binary-encoded {0, ai, a2, Opj-valued weights: With S'' > for all k , and ^ij — ^ ' 

for all i,j , we get the case of all Wi_j in the set {0, ai, 02, Cp} having binary-encoded elements . 

3. 0/1-valued weights: This is, of course, the important common special case of[T]and[2| 

Because [1] is a much more common generalization of [3| than is [2l we refer to our general setting 
as generalized unary encoding. We do note that there are cases where, with respect to a particular 
nonlinear combinatorial optimization problem, one algorithm is efficient for case [2| but not for case 
[U and for another algorithm vice versa (see [2l[5] for example). Furthermore, there are broad cases 
where with binary encoding a problem is provably intractable, but we will see that we can construct 
an efficient algorithm when the weights have a generalized unary encoding. 

1.2 Fibers 

We begin with some definitions. For W € Z''^" and polytope V C R" , we let WV denote the image 
of V under the linear map x ^ Wx . Clearly WV C M'* is also a polytope. For u gM.'^ , we define 

W-^u := {x e M" : Wx = u} . 
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For u G M'* , we refer to (W^^u) fl "P as the Wp-fiber of u . If and V are clear from context, then 
we may just say "the fiber of u ." For it G M'' , we note that u e WV if and only if the H/p-fiber of u 
is nonempty. 

First, we point out how we can optimize efficiently a linear function on the fiber of any u G M"^ . In 
particular, we can also determine nonemptiness of such a fiber. 

Lemma 1 Assume that we have a separation oracle for the polytope V C M" . Let c G Z" , W E Z"^^" 
and u G he binary encoded. Then we can solve the linear- objective optimization problem 

maxjc^a; : x E [W^'^u) nV] 

in polynomial time. In particular, we can test nonemptiness of the W-p -fiber of u in polynomial time. 

Proof The problem is just the linear program 

max jc^x : Wx = w , x G P} , 

which we can solve in polynomial time via the ellipsoid method, as we assume that we have available 
a separation oracle ior V . □ 

Unfortunately, even though V is assumed to have integer vertices (it is the convex hull of the 
finite set of integer points), the Wp-fiber of u may have some integer vertices and some fractional 
ones (after all, we are just intersecting V with a linear subspace). Therefore, an optimal extreme point 
solution of the linear program of Lemma[T]may be at a fractional vertex of the Wp-fiber of u . In short, 
fibers of arbitrary u G WP are not well behaved. However, we will see that fibers of m G vert{WV) 
are better behaved. Specifically, we will now demonstrate that if m G veTt{WV) , then we can find 
efficiently an integer optimizer of any linear function on the Wp-fiber of u . 

Lemma 2 Assume that we have a separation oracle for the polytope V C M" . Let c G Z" , W £ Z"*^" 
and u G vert(TyP) be binary encoded. Then we can solve the optimization problem 

maxjc^a; : x £ {W^'^u) C^V CM^') 

in polynomial time. 

Proof It is well known that (i) the image Q of a polytope V under a linear (even affine) map is a 
polytope, and (ii) the preimage (in V) of every face of Q is a face of P . In particular, the preimage 
(in V) of every vertex of Q is a face of V . 

Therefore, if u G vert(iyp) , then the Wp-fiber of u is a face of V , and so such a fiber has itself 
integer vertices (because V does). Therefore, an extreme-point solution of the linear program 

max jc^x : Wx = u , x £ V} 

will be integer. □ 

The following result is not needed for our purposes, but it is interesting in its own right. 

Proposition 1 Assume that we have V — {x £ R" : Ax < b} , for some binary encoded A £ Z™^" 
and b £ Z™ . Let W £ Z"*^" and u £ vert(V1^7^) be binary encoded, Then we can compute an inequality 
description of the W-p-fiber of u as a face of V in polynomial time. 

Proof For each row I < i < m , we simply solve the linear program 

Zi :— min {Ai.x : Wx = u , Ax < b} . 

xGR" 

Then 



{W-^u)nV= |x G M" : A,.i 



— bi if Zi l^)- bi , for 1 < i < m 



□ 
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Next, we demonstrate that under an appropriate encoding of the data, the set of vertices of WV 
can be computed efficiently. 

Lemma 3 Let W £ Z"*^" be given. We assume that J- is a finite subset of , and further that 
T C S{n,(3) := {x G R" : l^a; < /?} , for some (3 G Z+ . Then, for fixed d , we can compute the 
vertices of WV in time polynomial in n , P , and the length of the generalized unary encoding of W . 

Proof For x £ J- , consider u :~ Wx . For 1 < i < d , we have 

:= W,.x = = EjLi (ELi '^M-flfe) J2Li{j2]=iSljXj^ak . 

Observe that the nonnegative integer coefficients X]j=i ^ij^j of the ak are bounded by a polynomial 
in the unary j3 and the unary (5f : Letting ui :— max \ Sfj | , we observe that 

E;=i'55x,|<||55||^||x||,<../3. 

This means that there are only a polynomial number of possible values for each Ui (p is fixed), and 
then (with d fixed) only a polynomial number of images u = Wx of x £ J- . 
Concretely, for fc = 1, . . . ,p , we let m'^ range over 

{0,±l,...,±u;/3}'* CZ"^ . 

For each of the (2w/3 + 1)^'' choices of (m-^, . . . , w^) , we let u := J2k=i ^ku'' ■ For each such u we check 
if M € WV using the algorithm of Lemma [H and so we obtain a set U C Z'^ , comprising these points 
u having a nonempty fiber, having a polynomial number of elements. 

Now, because U is contained in WV and contains vert(W^7') , it is clear that WV — conv([/) . 

Finally, because the dimension d is fixed, we can compute the vertices of conv([/) in polynomial 
time. □ 



1.3 Exact algorithm for quasiconvex maximization 

In this subsection, we take the facts that we have already built up to give an efficient algorithm to 
maximize a function that is quasiconvex (that is, its "lower level sets" {2; £ M'* : f{z) < /} are convex 
subsets of M.'^ , for all / G R ; see [T], for example). Of course, the algorithm also applies to ordinary 
convex functions. 

Theorem 1 Suppose that we are given W £ Z''^" , quasiconvex function f : W^' — > R specified by a 
comparison oracle, and a finite subset J- of Z" . We further assume that T C S(n, f3) := {x £ R" : 
l^x < P} , for some (3 £ Z+ , and that we have a separation oracle for V := conv{T) . Then, if d is 
fixed, we can solve P{J' , max, /, W) in time polynomial in n , (3 , and the size of the generalized unary 
encoding of W . 

Proof Because f{u) is quasiconvex on the polytope WV and fw{x) := f{Wx) is quasiconvex on the 
polytope V , and the maximum of a quasiconvex function on a polytope is attained at a vertex of the 
polytope, we have 

max{/(Wa;) : a; G vert(P)} = max{/(Wa;) : x£V} 

= max{/(u) : u £ WV} = max{/(u) : u £ veTt{WV)} . 

First, via Lemma [3l we compute efficiently vert(W^'P) . Next, using the comparison oracle for / , 
we can compare values of f{u) , for all u £ vert(M^7^) , to obtain the best u . Then, for the best 
u £ veTt{WV) , via LemmalU using an arbitrary linear objective vector c G Z" in general position, we 
efficiently find an extreme point a;" of the W-p-fiber of u . By Lemma [21 such a point x" will be integer 
and will thus solve P{J-, max, /, W) . □ 
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2 Approximation algorithms for norms and related functions 

In this section, we consider approximation algorithms as a means of relaxing the assumptions that gave 
us an efficient algorithm in the previous section. In tj2.11 we provide an efficient approximation algorithm 
for combinatorial norm maximization. In §2.2) we provide an efficient approximation algorithm for 
combinatorial minimization. This latter algorithm applies to a very broad generalization of norms. 



2.1 Approximation algorithm for norm maximization 

The following theorem provides an approximation algorithm for maximizing a norm / , that runs 
in time that is polynomial even in the bit size of the weights Wj^j and even if d is variable. The 
approximation ratio depends on the (positive) constants for the standard equivalence of the norm / 
with the infinity norm: 



calculated. Letting {ei, . . . , e^} denote the standard basis of K.'* , a valid choice of is the sum of 
/ evaluated on each of the basis elements (i.e., C/ := fi^i))- Furthermore, the best C/ is the 

minimum of the convex function / on the imit sphere in the oo-norm (i.e., C/ :— min{/(M) : ||u||oo = 
1} . This is not a convex optimization problem because we are restricted to the boundary of the (unit) 
ball (of the oo-norm). If however we take d to be constant or even say 0(log n) , then we can solve 
2*^ convex minimization problems (via say an ellipsoid method) , one for reach facet of the unit ball of 
the oo-norm, in order to efficiently calculate a valid (in fact, best) value of Cf . Finally, for the special 
case of f{u) := \\u\\p , our algorithm finds a ^^/''-approximate solution. 

Theorem 2 Suppose that we are given nonnegative W G '/^d.xn ^ jr ^ finite subset of . We 
further assume that J- C S{n,f3) :— {x G M" : l^a; < /?} , for some f3 € Z+ , and that we have 
a separation oracle for V := conv{J-) . Then, for any norm f given by a comparison oracle, there 
is an algorithm that determines a (^C-^ /Cf^ -approximate solution to P {J^ , max, f , W) , in time that 
is polynomial in d ,n , j3 , and max [logu;^^-] . Moreover, for f{u) :— \\u\\p , for any p satisfying 
1 < p < oo , our algorithm determines a d^^^ -approximate solution. 

Proof First, let / be an arbitrary norm. The algorithm is the following: For i = 1, . . . ,d , solve the 
linear-programming problem 



obtaining an optimal vertex x"^ oiV , and let be its VF- image. Then output x'' such that 



We now show that this provides the claimed approximation. Let s satisfy ||u*'||oo — max^^j^ 1 1 loo ■ 
First, we claim that any u £ vert(M^7') satisfies ||u||oo < ll'^^lloo ■ To see this, recall that W and 
hence the are nonnegative; choose any point x G (W~^u) n T (hence u = Wx), and let t satisfy 
Ut = 1 1 u I loo = maxf^i Ui . Then, as claimed, we get 

||w||oo = ut = Wt.i < msix{Wt.x : X e T} = T^t.x* = 4 < < Halloo ■ 

Let X* be an optimal solution, and let u* be its VF-image. As norms are (quasi) convex, we can 
without loss of generality take x* to be a vertex of V and u* to be a vertex of WV . 
Then we have the following inequalities: 




ma.x{Wi.x : x £ P} , 



maxti Ikil 



p 



fiWx*)^fiu*) < \\u*\\^ < Cf \\u'\\ 

< (Cf/Cf) f{u^) < (Cf/Cf) /(u'-) 



(cf/Cf) fiwxn . 



Finally, observing that for p satisfying 1 < p < oo , we have Cf 
that for p-norms we get a d^/^-approximate solution. 



1 and Cf = d^/P , we conclude 

□ 
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2.2 Approximation algorithm for ray-concave minimization 

In this subsection, using some of the methodology that we already developed, we obtain an efficient 
approximation algorithm for nonlinear combinatorial minimization. Our algorithm applies to certain 
functions that generalize norms via, what will at first seem paradoxical, a type of concavity. 
A function / ; R'^ — > M is ray- concave if 

(i) A/(u) < f{Xu) for M e M'[ , < A < 1 ; 
Analogously / : R.'^ ^ R is ray- convex if 
(i') A/(u) > f{Xu) for w e , < A < 1 ; 

Naturally, we say that / : M^J. — > M is homogeneous if it is both ray-concave and ray-convex. 
A function / : K^J: — > R is non- decreasing if 

(ii) ,f{u) < f{u) , for u, u G M'[ , u<u . 

We are mainly interested in ray-concave non-decreasing functions / : R^J_ R . In particular, by 
{i) with A = , we have /(O) > , and hence by {ii) , we have f{u) > for all u G R!^ . 
Notice that ordinary concavity of a function / has the special case: 

A/(ii) + (1 - A)/(0) < /(Am + (1 - A)0) , for w e R^ , < A < 1 , 

and so if / is concave with /(O) = , then it is ray-concave — thus justifying our terminology. 

Notice further that if / is a norm on R"* , then it of course non-decreasing on ; additionally, 
it is not only ray-concave, but it is more strongly homogeneous (and more of course). In general, 
homogeneity already implies /(O) = . A concrete example of a ray-convex function that is not 
generally homogeneous is h(u) :— ni=i • 

There are other interesting and useful functions that are ray-concave. Suppose that ray-concave 
5 : R^ -> R and ray-convex : R5{_ ^ R satisfy 

(iii) g{u + v) — g{u) > h{u + v) — h{u) , for all m, u G R'J_ . 

That is, g grows no slower than h on Rjj. . Then f{u) :— g{u) — h{u) is of course non-decreasing. 
Moreover, if g is ray-concave on R^J. and h is ray-convex on R^J. , then / is also ray-concave on R^ . 

As a concrete and natural example, consider giu) := ||it||i and h{u) := \\u\\p for any integer p>l 
or infinity. Then f{u) := g{u) — h{u) — \\u\\i — \\u\\p is ray-concave and non-decreasing on . Notice 
that already for d = 2 and p = oo , f{u) is not a norm — indeed, for this case f{u) = min(?ii, U2) . 

Theorem 3 Suppose that nonnegative W G Z"*^" is given, d is fixed, f : R'J_ M. is a ray- concave 
non- decreasing function given by a comparison oracle, and IF is a finite subset of Z" . We further 
assume that T C S{n,P) := {x G R" : l^cc < /3} , for some (3 G , and that we have a separation 
oracle for V '.— conv{J-) . Then we can compute a d-approximate solution to P{T ,XQhi, f ,W) in time 
polynomial in n , (3 , and the size of the generalized unary encoding ofW . Moreover, for f(u) := ||m||p 
(the p-norm), 1 < p < oo , the algorithm actually determines a d^/'^- approximate solution, where 
1/p + l/q ^ I (with obvious interpretations when a denominator is 00) . 

Proof Because we are minimizing a function that need not be concave, it may well be the case that 
the optimal solution of P{J-, min, /, W) is not realized at an extreme point of P and consequently, it 
may be that min{/(u) : u G WP} is not realized at an extreme point of WP . Nonetheless, we will 
focus on extreme points of WP . 

Apply the algorithm of Lemma[3]so as to construct the set of vertices oiWP . Using the comparison 
oracle of / , identify a vertex u G vert(M^7') attaining minimum value f(u) . Now apply the algorithm 
of Lemma [1] to m and, as guaranteed by Lemma [21 obtain an integer x in the fiber of u , so that 
u = Wx . Output the point x . 

We now show that this provides the claimed approximation. Let x* be an optimal integer point, 
and let u* :— Wx* be its image. Let u' be a point on the boundary of WP satisfying u' < u* . By 
Caratheodory's theorem (on a facet of WP containing u'), u' is a convex combination u' = J2i=i ^i^* 
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of some r < d vertices of WV for some coefficients A,; > with X]i=i — ^ ■ Let t be an index for 
which At = maxj{A.j} . Then At > i Ai = 1/r > l/d . 

Because W is nonnegative, we find that so are u' and the u' , and hence we obtain 

f{Wx) = f{u) < f{u') < dXffiu') < d-f{\tu') 

< d-f{Y:,=^Ku') = d-f{u') < d-fiu*) = d-fiWx*). 

This proves that x provides a d-approximate solution. 

Now consider the case of the p-norm f{u) = \\u\\p . First, we note that the first part already covers 
the case of p = oo . So we confine our attention to p < oo . We complete now the proof for 1 < p < oo , 
and then it is an easy observation that the proof still goes through (in a simplified form) establishing 
also that for p = 1 we get a 1-approximate (i.e., optimal) solution. 

By the Holder inequality (|a^6| < ||a||p||f'||q , for l/p+ l/q = 1), we have 

1 = ELii-A.<(EUi^)'^'(E:=iAr)^/'' = ^^/^ELiAD'^' < d^/" (EU aD'^^ . 

Find s with ||u*||p — mini{||M*||p} and recall that the are nonnegative. We then have 

nwx) = Ml < < rf^/MELiADIkll? < rf^/'E[=i(Aflkil^) 

< rfP/« A..u*||p = dP/'J\\u'\\P < dP^i\\u*\\P = dP^ifP{Wx*) , 

which proves that in this case, as claimed, x provides moreover a d^/^-approximate solution. □ 



3 Randomized algorithm for vectorial matroid intersection 

The algorithm of i)1.3l can be used to solve efficiently the problem P{J-', max, /, W) , when T is the set 
of characteristic vectors of sets that are common independent sets or bases of a pair of matroids on a 
common ground set, and / is convex. Such an algorithm only needs an independence oracle for each 
of the matroids. Even better, we can do likewise when is the set of integer points that are in a pair 
of integral polymatroids or associated base polytopes. For polymatroids, we only need an oracle that 
evaluates the associated submodular functions. 

In this section, we relax the assumption of / being convex. Indeed, we give a randomized algorithm 
for arbitrary / presented by a comparison oracle. By this we mean an algorithm that has access to a 
random bit generator, and on any input, it outputs the optimal solution with probability at least half. 
We restrict our attention to unary encoded W and T being the set of characteristic vectors of common 
bases of a pair of vectorial matroids Mi and M2 on the common ground set {1,2, ... ,n} . Our broad 
generalization of the standard linear-objective matroid intersection problem is defined as follows. 

Nonlinear Matroid Intersection. Given two matroids Mi and M2 on the common ground set 
N = {1, . . . ,n} , integer weight matrix W G Z''^" and an arbitrary function f : M.'^ ^ M. specified by 
a comparison oracle . Let be the set of characteristic vectors of common bases of Mi and M2 . Find 
an X € J- maximizing (or minimizing) f{Wx) . 

When Ml and M2 are vectorial matroids, they can be represented by a pair of r x n matrices. We 
assume, without loss of generality, that r is the common rank of Mi and M2 . Note that in this case T 
satisfies C /„ , where In,r is the set of vectors in R" with exactly r nonzero entries which are equal 
to 1. Thus our problem consists on finding an x that maximizes (minimizes) f{Wx) , where x is the 
characteristic vector corresponding to the set of column indices defining a common base. We assume 
the matroids have at least one common base which can be efficiently checked by the standard matroid 
intersection algorithm. 

By adding to each Wij a suitable positive integer v and replacing the function / by the function 
that maps each u G K'^ to /(ui — rv, . . . ,Ud — rv) if necessary, we may and will assume without loss of 
generality throughout this section that the given weight matrix is nonnegative, W € Z^[.^" . 

In this section we will be working with polynomials with integer coefficients in the n + d variables 
aj , j = 1, . . . ,n and bi , i = 1, . . . , d . Define a vector 7 in R" whose entries are monomials by 
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d 



i=i 

For each vectors x £ Z" and u G , the corresponding monomials are 

n d 

j=i i=i 

Let Aff and Mf be the r x r matrices comprising the r columns whose indices correspond to the 
nonzero entries in x . Finally, for each u G Z'J^ define the following polynomial in the variables a — (aj) 
only. 

gu{a) := ^ {det(Mn • det(Mf : a; G , Wx ^ u} . 

Define U :~ W T — {Wx : x G T} as the set of points that are the projection of T under the weight 
matrix W . Let Mi (7) be a matrix whose j-th column is M("fj . We then have the following identity 
in terms of the ^^(a) . 

n 

det(Mi(7)M2^) = det(Mf(7)).det(M|) = ^ det(Mf ) • det(M|) [] 7f 

n n d 

= J2 det(Mf ) . det(Mf ) J] 7^ = E det(Mf ) • det(M|) J] H ^^'^ 

n n d n d 

= Edet(Mf).det(M-)narnn^r-^'^^ = Edet(Mf).det(M2-)n«rn^r'-^ 

x£j^ j = l j = l 1=1 X<^J^ j=l 1=1 

= E 'iet(Mn ■ det(M|)a^5'^-^ = E E det(Mf ) • det(M|)a^6^-^ = ^ ■ 

Next we consider integer substitutions for the variables aj . Under such substitutions, each gu{o.) 
becomes an integer, and det(Mi(7)Af2^) — J2ueu 9'^(^^^^ becomes a polynomial in the variables b = 
(hi) only. Given such a substitution, let U (a) :— {u G U : gu{a) ^ 0} be the support of det(Mi(7)M2^) , 
that is, the set of exponents of monomial 6" appearing with nonzero coefficient in det(Afi(7)M2^) . 

The next proposition concerns substitutions of independent identical random variables uniformly 
distributed on {1, 2, . . . , s} , under which U{a) becomes a random subset U ^ U . 

Proposition 2 Suppose that independent uniformly- distributed random variables on {1, 2, . . . , s} are 
substituted for the aj , and let U := {u E U : gu{o) 7^ 0} be the random support 0/ det(Mi(7)Af2^) . 
Then, for every u G U = {Wx : x G J-} , the probability that u ^ U is at most r/s . 

Proof Consider any u G U , and consider gu{o,) as a polynomial in the variables a — (oj) . Because 
u = Wx for some x G J- , there is as least one term det(Aff ) • det(Mf in gu{a) ■ Because distinct 
X G J- give distinct monomials , no cancelations occur among the terms det(A'/f ) • det(Mf in 
gu(fl) ■ Thus, gu{a) is a nonzero polynomial of degree r . The claim now follows from a lemma of 
Schwartz [10^ stating that the substitution of independent uniformly-distributed random variable on 
{1, 2, . . . , s} into a nonzero multivariate polynomial of degree r is zero with probability at most r/s . 

a 

The next lemma establishes that, given aj , the support U{a) of det(Mi(7)M2^) is polynomial-time 
computable. 

Lemma 4 For every fixed d , there is an algorithm that, given W G Zjjl^" , and substitutions aj G 
{1, 2, . . . , s} , computes U(a) — {u gU : gu(a) 7^ 0} in time polynomial in n , max W^^j and [logs] . 
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Proof The proof is based on interpolation. For each u , let g„ := gu{a) be the fixed integer obtained 
by substituting the given integers aj . Let z rmax W^j and Z :— {0, 1, . . . , z}'^ . Note that z and 
\Z\ = (1 + zY are polynomial in n and the unary encoding of W . Then U{a) C U C Z , and hence 
det(Mi(7)M^) = J2uez 9ub" ^ polynomial in d variables b = (bi) involving at most \Z\ = {z + 1)'^ 

monomials. For t = 1,2, . . . , {z + l Y , consider the substitution 6j := t^-^"*"^^ , i = 1, . . . , d , of suitable 
points on the moment curve. Under this substitution of b and a given substitution of a , the matrix 
Mi{j{t))Mj becomes an integer matrix and so its determinant det(Mi(7(t))M2'^) becomes an integer 
number that can be computed in polynomial time by Gaussian elimination. So we obtain the following 
system of {z + I)'' equations in {z + I)'' variables <?„ , u G Z = {0, 1, . . . , 2}'' , 

d 

det(Mi(7(t))Mj) = ^5«n^r = E*^'=^"'^'^'^'"' t=l,2,...,{z + lf . 

uez fe=i uez 

As u = {ui, . . . ,Ud) runs through Z , the sum X]fc=i '^ki.z + 1)'^"^ attains precisely all \Z\ = + 1)'' 
distinct values 0, 1, . . . , (z + I)'' — 1 . This implies that, under the total order of the points w in Z by 
increasing value of '^^k^i '"fe(2 + 1)*^^^ , the vector of coefficients of the g„ in the equation corresponding 
to t is precisely the point (t", , . . . , t^-^-^^'> on the moment curve in ~ M(^+^) . Therefore, the 
equations are linearly independent, and hence the system can be solved for the Qu = gu{o) and the 
desired support 17(a) = {u G C/ : Quia) ^ 0} of det(Mi(7)M2) can indeed be computed in polynomial 
time. □ 

Next, we demonstrate that finding an optimal common base for a nonlinear matroid intersection 
problem can be reduced to finding an optimal W-image for a small number of subproblems. Consider 
data for a nonlinear matroid intersection problem, consisting of two matroids Mi and M2 on the 
common ground set N = {1, . . . ,n} , weight matrix W G Z''^" , and function / : R*^ ^ R . Each 
subset S C N gives a subproblem of nonlinear matroid intersection as follows. The matroids of the 
subproblem are the restriction of Mi and M2 to S ; that is, the matroid Mi.S, i = 1,2, on ground 
set S in which a subset / C S* is independent if and only if it is independent in Mi . Note that the 
restriction of the matrix representing Mi to the columns indexed by S is the matrix representation of 
Mi.S . Then, the subproblem is the nonlinear matroid intersection problem, consisting of two matroids 
Ml. 5 and M2.5 on the common ground set S , the new weight matrix is the restrictions of the original 
weight matrix to S , and the function / : R"^ — > R is the same as in the original problem. We have the 
following useful statement. 

Lemma 5 The nonlinear matroid intersection problem of finding an optimal common base of two 
matroids Mi and M2 is reducible in time polynomial in n to finding an optimal W -image for at most 
n+1 subproblems. 

Proof Denote by u*{S) the optimal W-image for the matroid intersection subproblem defined by the 
matroids Mi.S and M2.S' on the ground set S C N . Now compute an optimal common base for the 
original problem, by computing an optimal VF-image for n + 1 such subproblems as follows. 

Start with S := N; 

Compute an optimal W-image u* := u*{N) of the original problem; 
for j=l,2,. . . ,n do 
LetT:=5\{j}; 

Compute rank{Mi.T) and rank{M2.T); 

if rank{Mi.T) = r and rank{M2.T) = r then 

Compute an optimal T^-image u*{T); 

if f{u*{T)) > f{u*) then let S := T; 
end 
end 

return B := S; 



It is not hard to verify that the set B obtained is indeed an optimal common base for the original 
problem. □ 
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We are now in position to state our main theorem of this section. By a randomized algorithm that 
solves the nonhnear matroid intersection problem we mean an algorithm that has access to a random 
bit generator and on any input to the problem outputs a common base that is optimal with probability 
at least a half. The running time of the algorithm includes a count of the number of random bits used. 
Note that by repeatedly applying such an algorithm many times and picking the best common base, 
the probability of failure can be decreased arbitrarily; in particular, repeating it n times decreases the 
failure probability to as negligible a fraction as 1/2" while increasing the running time by a linear 
factor only. 

Theorem 4 For every fixed d , there is a randomized algorithm that, given an integer weight matrix 
W G Z"^^" , and a function f : M'' —> M presented by a comparison oracle, solves the nonlinear matroid 
intersection problem in time polynomial in n and max Wij . 

Proof As explained at the beginning of this section, we may and will assume that W is nonnegative. 
We claim that the following adaptation of the algorithm of Lemma [5] provides a common base which 
is optimal with probability at least 1/2 . Apply the algorithm of Lemma O but at each iteration 
apply the algorithm of Lemma [4] to determine a random optimal VF-image u*{T) which we claim to 
be the optimal ly-image u*{T) with probability at least 1 — l/(2(n+ 1)) . At each iteration, the 
algorithm either returns: a correct optimal W-image with probability at least 1 — 1/ (2(n + 1)) ; or a 
wrong one, or the computed U is empty with probability at most 1/ (2(n+ 1)) . In case that the U 
computed by the randomized algorithm is empty, we set u*{T) to be a virtual point with objective 
value /(m*(T)) = -oo . 

Now we show that the probability of success in each iteration, that is, of computing an optimal 
M^-image u*{T) for any T C iV in which rank{Mi.T) = rank{M2.T) = r , is at least 1 - 1/ (2(n + 1)) , 
as claimed before. Indeed, using polynomially many random bits, draw independent and uniformly- 
distributed integers from {1, 2, . . . , 2r(n + 1)} and substitute them for the Oj . Next compute a point 
u*(T) £ U attaining max{/(u) : u £ U} using the algorithm underlying Lemma |4] and a comparison 
oracle. By Proposition [21 with probability at least 1 — 1/ (2(n + 1)) , we have u*{T) G J7 , in which 
event max{/(M) : u G U} = max{/(u) : u S U} and then u*(T) is indeed an optimal M^-image. 

Therefore, the probability that the algorithm obtains a correct optimal ly-image at all iterations, 
and consequently outputs an optimal common base B of Mi and M2 , is at least the product over all 
iterations of the probability of success in each iteration, that is (1 — 1/ (2(n + l)))""^^ ■ Hence at least 
1/2 as desired. This completes the proof. 

□ 
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